Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 29
Filtrar
1.
NAR Genom Bioinform ; 6(1): lqae017, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38486887

RESUMO

Latest advancements in the high-throughput single-cell genome (scDNA) and transcriptome (scRNA) sequencing technologies enabled cell-resolved investigation of tissue clones. However, it remains challenging to cluster and couple single cells for heterogeneous scRNA and scDNA data generated from the same specimen. In this study, we present a computational framework called CCNMF, which employs a novel Coupled-Clone Non-negative Matrix Factorization technique to jointly infer clonal structure for matched scDNA and scRNA data. CCNMF couples multi-omics single cells by linking copy number and gene expression profiles through their general concordance. It successfully resolved the underlying coexisting clones with high correlations between the clonal genome and transcriptome from the same specimen. We validated that CCNMF can achieve high accuracy and robustness using both simulated benchmarks and real-world applications, including an ovarian cancer cell lines mixture, a gastric cancer cell line, and a primary gastric cancer. In summary, CCNMF provides a powerful tool for integrating multi-omics single-cell data, enabling simultaneous resolution of genomic and transcriptomic clonal architecture. This computational framework facilitates the understanding of how cellular gene expression changes in conjunction with clonal genome alternations, shedding light on the cellular genomic difference of subclones that contributes to tumor evolution.

2.
Int J Biol Macromol ; 262(Pt 2): 130131, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38354937

RESUMO

Deleted in breast cancer 1 (DBC1) is a human nuclear protein that modulates the activities of various proteins involved in cell survival and cancer progression. Oxidized form of nicotinamide adenine dinucleotide (NAD+) is suggested to bind to the Nudix homology domains (NHDs) of DBC1, thereby regulating DBC1-Poly (ADP-ribose) polymerase 1 (PARP1) interactions, resulting in the restoration of DNA repair. Using Nuclear Magnetic Resonance (NMR) and Isothermal Titration Calorimetry (ITC), we confirmed NAD+ and its precursor nicotinamide mononucleotide (NMN) both bind the NHD domain of DBC1 (DBC1354-396). NAD+ likely interacts with DBC1354-396 through hydrogen bonding, with a binding affinity (8.99 µM) nearly twice that of NMN (17.0 µM), and the key binding sites are primarily residues E363 and D372, in the agreement with Molecular Docking experiments. Molecular Dynamics (MD) simulation further demonstrated E363 and D372's anchoring role in the binding process. Additional mutagenesis experiments of E363 and D372 confirmed their critical involvement of ligand-protein interactions. These findings lead to a better understanding of how NAD+ and NMN regulate DBC1, thereby offering insights for the development of targeted therapies and drug research focused on DBC1-associated tumors.


Assuntos
Reparo do DNA , NAD , Humanos , NAD/metabolismo , Simulação de Acoplamento Molecular , Sobrevivência Celular , Sítios de Ligação
3.
J Agric Food Chem ; 71(43): 16352-16361, 2023 Nov 01.
Artigo em Inglês | MEDLINE | ID: mdl-37800479

RESUMO

Nonspecific phospholipase C (NPC) plays a pivotal role in hydrolyzing phospholipids, releasing diacylglycerol─an essential second messenger. Extensive research has elucidated the structure and function of bacterial and plant NPCs, but our understanding of their fungal counterparts remains limited. Here, we present the first crystal structure of a fungal NPC derived from Rasamsonia emersonii (RePLC), unraveling its distinguishable features divergent from other known phospholipase C. Remarkably, the structure of RePLC contains solely the phosphoesterase domain without the crucial C-terminal domain (CTD) found in plant NPCs, although CTD is important for their activity. Through a comparative analysis of structural features among NPCs from diverse species combined with structure-based mutation analyses and bioinformatics methods, we propose a potential molecular mechanism that may universally underlie the catalysis of phospholipid hydrolysis in fungal NPCs. Furthermore, our study sheds light on the captivating evolutionary trajectory of enzymes across diverse species.


Assuntos
Fosfolipídeos , Fosfolipases Tipo C , Fosfolipases Tipo C/genética , Hidrólise , Fosfolipídeos/química , Catálise
4.
Front Genet ; 14: 1148470, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36911403

RESUMO

Colon adenocarcinoma is the most common type of colorectal cancer. The prognosis of advanced colorectal cancer patients who received treatment is still very poor. Therefore, identifying new biomarkers for prognosis prediction has important significance for improving treatment strategies. However, the power of biomarker analyses was limited by the used sample size of individual database. In this study, we combined Genotype-Tissue Expression (GTEx) and The Cancer Genome Atlas (TCGA) databases to expand the number of healthy tissue samples. We screened differentially expressed genes between the GTEx healthy samples and TCGA tumor samples. Subsequently, we applied least absolute shrinkage and selection operator (LASSO) regression and multivariate Cox analysis to identify nine prognosis-related immune genes: ANGPTL4, IDO1, NOX1, CXCL3, LTB4R, IL1RL2, CD72, NOS2, and NUDT6. We computed the risk scores of samples based on the expression levels of these genes and divided patients into high- and low-risk groups according to this risk score. Survival analysis results showed a significant difference in survival rate between the two risk groups. The high-risk group had a significantly lower overall survival rate and poorer prognosis. We found the receiver operating characteristic based on the risk score was showed to accurately predict patients' prognosis. These prognosis-related immune genes may be potential biomarkers for colorectal cancer diagnosis and treatment. Our open-source code is freely available from GitHub at https://github.com/gutmicrobes/Prognosis-model.git.

5.
Comput Biol Med ; 157: 106774, 2023 05.
Artigo em Inglês | MEDLINE | ID: mdl-36931204

RESUMO

Studies have found that different immune subtypes are present in the same tumor. Different tumor subtypes have different tumor microenvironments (TME). This means that the efficacy of immunotherapy in actual applications will, therefore, have different results. Existing tumor immune subtype studies have mostly focused on immune cells, stromal cells, genes and molecules without considering the presence of microbes. Some studies have shown that microflora can strongly promote many gastrointestinal cancers. The microbiome has, therefore, become an important biomarker and regulatory factor of cancer progression and therapeutic responses. In addition, the presence of microflora can strongly regulate the host immune system, indirectly affecting tumor growth. Taken together, it is important to study the relationships that develop among tumor tissue microorganisms, tumor immune subtype, and the TME. In this study, correlations between microbial abundance, immune cell infiltration, immune gene expression and tumor immune subtype were studied. To accomplish this, tissue microorganisms and immune cell ratios with significant differences between the different cancers were obtained by comparing 203 gastric cancer and intestinal cancer samples. Two immune subtypes of intestinal samples were obtained by K-means clustering algorithm and tissue microorganisms, immune cell ratios and immune-related genes with significant differences between different immune subtypes were screened through Wilcoxon rank sum test. The results showed that Clostridioides difficile, Aspergillus fumigatus, Yarrowia lipolytica, and Fusarium pseudograminearum were all closely associated with the identified tumor immune subtypes. Our open-source software is freely available from GitHub at https://github.com/gutmicrobes/IMM-subtype.git.


Assuntos
Neoplasias Gástricas , Algoritmos , Aspergillus fumigatus , Análise por Conglomerados , Imunoterapia , Microambiente Tumoral
6.
Endocr Relat Cancer ; 29(12): 665-679, 2022 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-36165930

RESUMO

High-grade (grade 3) neuroendocrine neoplasms (G3 NENs) have poor survival outcomes. From a clinical standpoint, G3 NENs are usually grouped regardless of primary site and treated similarly. Little is known regarding the underlying genomics of these rare tumors, especially when compared across different primary sites. We performed whole transcriptome (n = 46), whole exome (n = 40), and gene copy number (n = 43) sequencing on G3 NEN formalin-fixed, paraffin-embedded samples from diverse organs (in total, 17 were lung, 16 were gastroenteropancreatic, and 13 other). G3 NENs despite arising from diverse primary sites did not have gene expression profiles that were easily segregated by organ of origin. Across all G3 NENs, TP53, APC, RB1, and CDKN2A were significantly mutated. The CDK4/6 cell cycling pathway was mutated in 95% of cases, with upregulation of oncogenes within this pathway. G3 NENs had high tumor mutation burden (mean 7.09 mutations/MB), with 20% having >10 mutations/MB. Two somatic copy number alterations were significantly associated with worse prognosis across tissue types: focal deletion 22q13.31 (HR, 7.82; P = 0.034) and arm amplification 19q (HR, 4.82; P = 0.032). This study is among the most diverse genomic study of high-grade neuroendocrine neoplasms. We uncovered genomic features previously unrecognized for this rapidly fatal and rare cancer type that could have potential prognostic and therapeutic implications.


Assuntos
Neoplasias Intestinais , Tumores Neuroendócrinos , Neoplasias Pancreáticas , Neoplasias Gástricas , Humanos , Neoplasias Gástricas/patologia , Tumores Neuroendócrinos/patologia , Prognóstico , Genômica , Mutação , Neoplasias Pancreáticas/patologia , Neoplasias Intestinais/patologia
7.
Front Immunol ; 13: 853213, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35493464

RESUMO

Recent transcriptomics and metagenomics studies showed that tissue-infiltrating immune cells and bacteria interact with cancer cells to shape oncogenesis. This interaction and its effects remain to be elucidated. However, it is technically difficult to co-quantify immune cells and bacteria in their respective microenvironments. To address this challenge, we herein report the development of a complete a bioinformatics pipeline, which accurately estimates the number of infiltrating immune cells using a novel Particle Swarming Optimized Support Vector Regression (PSO-SVR) algorithm, and the number of infiltrating bacterial using foreign read remapping and the GRAMMy algorithm. It also performs systematic differential abundance analyses between tumor-normal pairs. We applied the pipeline to a collection of paired liver cancer tumor and normal samples, and we identified bacteria and immune cell species that were significantly different between tissues in terms of health status. Our analysis showed that this dual model of microbial and immune cell abundance had a better differentiation (84%) between healthy and diseased tissue. Caldatribacterium sp., Acidaminococcaceae sp., Planctopirus sp., Desulfobulbaceae sp.,Nocardia farcinica as well as regulatory T cells (Tregs), resting mast cells, monocytes, M2 macrophases, neutrophils were identified as significantly different (Mann Whitney Test, FDR< 0.05). Our open-source software is freely available from GitHub at https://github.com/gutmicrobes/PSO-SVR.git.


Assuntos
Neoplasias Hepáticas , Transcriptoma , Algoritmos , Bactérias/genética , Biologia Computacional , Humanos , Neoplasias Hepáticas/genética , Software , Microambiente Tumoral/genética
8.
HGG Adv ; 3(2): 100085, 2022 Apr 14.
Artigo em Inglês | MEDLINE | ID: mdl-35146455

RESUMO

Identifying women at high risk for developing breast cancer is potentially lifesaving. Patients with pathogenic genetic variants can embark on a program of surveillance for early detection, chemoprevention, and/or prophylactic surgery. Newly diagnosed cancer patients can also use the results of gene panel sequencing to make decisions about surgery; therefore, rapid turnaround time for results is critical. Cancer Risk B (CR-B), a test that uses flow variant assays to assess the effects of variants in the DNA double-strand break repair, was applied to two groups of subjects who underwent coincidental gene panel testing, thereby allowing an assessment of sensitivity, specificity and accuracy, and utility for annotating variants of uncertain significance (VUS). The test was compared in matched peripheral blood mononuclear cells (PBMCs) and lymphoblastoid cells (LCLs) and tested for rescue in LCLs with gene transfer. The CR-B phenotype demonstrated a bimodal distribution: CR-B+ indicative of DSB repair defects, and CR-B-, indicative of wild-type repair. When comparing matched LCLs and PBMCs and inter-day tests, CR-B yielded highly reproducible results. The CR-B- phenotype was rescued by gene transfer using wild-type cDNA expression plasmids. The CR-B- phenotype predicted VUS as benign or likely benign. CR-B could represent a rapid alternative to panel sequencing for women with cancer and identifying women at high risk for cancer and is a useful adjunct for annotating VUS.

9.
Front Immunol ; 12: 665773, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34108966

RESUMO

The COVID-19 pandemic has caused more than three million deaths globally. The severity of the disease is characterized, in part, by a dysregulated immune response. CD16+ monocytes are innate immune cells involved in inflammatory responses to viral infections, and tissue repair, among other functions. We characterized the transcriptional changes in CD16+ monocytes from PBMC of people with COVID-19, and from healthy individuals using publicly available single cell RNA sequencing data. CD16+ monocytes from people with COVID-19 compared to those from healthy individuals expressed transcriptional changes indicative of increased cell activation, and induction of a migratory phenotype. We also analyzed COVID-19 cases based on severity of the disease and found that mild cases were characterized by upregulation of interferon response and MHC class II related genes, whereas the severe cases had dysregulated expression of mitochondrial and antigen presentation genes, and upregulated inflammatory, cell movement, and apoptotic gene signatures. These results suggest that CD16+ monocytes in people with COVID-19 contribute to a dysregulated host response characterized by decreased antigen presentation, and an elevated inflammatory response with increased monocytic infiltration into tissues. Our results show that there are transcriptomic changes in CD16+ monocytes that may impact the functions of these cells, contributing to the pathogenesis and severity of COVID-19.


Assuntos
COVID-19/virologia , Monócitos/virologia , Receptores de IgG/metabolismo , SARS-CoV-2/patogenicidade , Transcrição Gênica , Transcriptoma , Adulto , Idoso , Proteínas Reguladoras de Apoptose/genética , Proteínas Reguladoras de Apoptose/metabolismo , COVID-19/genética , COVID-19/imunologia , COVID-19/metabolismo , Estudos de Casos e Controles , Citocinas/genética , Citocinas/metabolismo , Feminino , Proteínas Ligadas por GPI/metabolismo , Perfilação da Expressão Gênica , Interações Hospedeiro-Patógeno , Humanos , Mediadores da Inflamação/metabolismo , Fatores Reguladores de Interferon/genética , Fatores Reguladores de Interferon/metabolismo , Masculino , Pessoa de Meia-Idade , Proteínas Mitocondriais/genética , Proteínas Mitocondriais/metabolismo , Monócitos/imunologia , Monócitos/metabolismo , RNA-Seq , SARS-CoV-2/imunologia , Índice de Gravidade de Doença , Análise de Célula Única , Adulto Jovem
10.
Sci Rep ; 10(1): 5009, 2020 03 19.
Artigo em Inglês | MEDLINE | ID: mdl-32193467

RESUMO

DNA copy number aberrations (CNA) are frequently observed in colorectal cancers (CRC). There is an urgent need for CNA-based biomarkers in clinics,. n For Stage III CRC, if combined with imaging or pathologic evidence, these markers promise more precise care. We conducted this Stage III specific biomarker discovery with a cohort of 134 CRCs, and with a newly developed high-efficiency CNA profiling protocol. Specifically, we developed the profiling protocol for tumor-normal matched tissue samples based on low-coverage clinical whole-genome sequencing (WGS). We demonstrated the protocol's accuracy and robustness by a systematic benchmark with microarray, high-coverage whole-exome and -genome approaches, where the low-coverage WGS-derived CNA segments were highly accordant (PCC >0.95) with those derived from microarray, and they were substantially less variable if compared to exome-derived segments. A lasso-based model and multivariate cox regression analysis identified a chromosome 17p loss, containing the TP53 tumor suppressor gene, that was significantly associated with reduced survival (P = 0.0139, HR = 1.688, 95% CI = [1.112-2.562]), which was validated by an independent cohort of 187 Stage III CRCs. In summary, this low-coverage WGS protocol has high sensitivity, high resolution and low cost and the identified 17p-loss is an effective poor prognosis marker for Stage III patients.


Assuntos
Biomarcadores Tumorais , Neoplasias Colorretais/genética , Neoplasias Colorretais/mortalidade , Variações do Número de Cópias de DNA/genética , Deleção de Genes , Proteína Supressora de Tumor p53/genética , Sequenciamento Completo do Genoma/métodos , Adulto , Idoso , Idoso de 80 Anos ou mais , Deleção Cromossômica , Cromossomos Humanos Par 17/genética , Feminino , Marcadores Genéticos , Humanos , Masculino , Pessoa de Meia-Idade , Estadiamento de Neoplasias , Prognóstico , Síndrome de Smith-Magenis/diagnóstico , Síndrome de Smith-Magenis/genética , Taxa de Sobrevida , Adulto Jovem
11.
Synth Syst Biotechnol ; 4(3): 150-156, 2019 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-31508512

RESUMO

Alignment-based database search and sequence comparison are commonly used to detect horizontal gene transfer (HGT). However, with the rapid increase of sequencing depth, hundreds of thousands of contigs are routinely assembled from metagenomics studies, which challenges alignment-based HGT analysis by overwhelming the known reference sequences. Detecting HGT by k-mer statistics thus becomes an attractive alternative. These alignment-free statistics have been demonstrated in high performance and efficiency in whole-genome and transcriptome comparisons. To adapt k-mer statistics for HGT detection, we developed two aggregative statistics T s u m S and T s u m * , which subsample metagenome contigs by their representative regions, and summarize the regional D 2 S and D 2 * metrics by their upper bounds. We systematically studied the aggregative statistics' power at different k-mer size using simulations. Our analysis showed that, in general, the power of T s u m S and T s u m * increases with sequencing coverage, and reaches a maximum power >80% at k = 6, with 5% Type-I error and the coverage ratio >0.2x. The statistical power of T s u m S and T s u m * was evaluated with realistic simulations of HGT mechanism, sequencing depth, read length, and base error. We expect these statistics to be useful distance metrics for identifying HGT in metagenomic studies.

12.
PeerJ ; 7: e7315, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31392094

RESUMO

The human gut microbiota plays a major role in maintaining human health and was recently recognized as a promising target for disease prevention and treatment. Many diseases are traceable to microbiota dysbiosis, implicating altered gut microbial ecosystems, or, in many cases, disrupted microbial enzymes carrying out essential physio-biochemical reactions. Thus, the changes of essential microbial enzyme levels may predict human disorders. With the rapid development of high-throughput sequencing technologies, metagenomics analysis has emerged as an important method to explore the microbial communities in the human body, as well as their functionalities. In this study, we analyzed 156 gut metagenomics samples from patients with colorectal cancer (CRC) and adenoma, as well as that from healthy controls. We estimated the abundance of microbial enzymes using the HMP Unified Metabolic Analysis Network method and identified the differentially abundant enzymes between CRCs and controls. We constructed enzymatic association networks using the extended local similarity analysis algorithm. We identified CRC-associated enzymic changes by analyzing the topological features of the enzymatic association networks, including the clustering coefficient, the betweenness centrality, and the closeness centrality of network nodes. The network topology of enzymatic association network exhibited a difference between the healthy and the CRC environments. The ABC (ATP binding cassette) transporter and small subunit ribosomal protein S19 enzymes, had the highest clustering coefficient in the healthy enzymatic networks. In contrast, the Adenosylhomocysteinase enzyme had the highest clustering coefficient in the CRC enzymatic networks. These enzymic and metabolic differences may serve as risk predictors for CRCs and are worthy of further research.

13.
Nucleic Acids Res ; 47(19): e115, 2019 11 04.
Artigo em Inglês | MEDLINE | ID: mdl-31350896

RESUMO

The human genome is composed of two haplotypes, otherwise called diplotypes, which denote phased polymorphisms and structural variations (SVs) that are derived from both parents. Diplotypes place genetic variants in the context of cis-related variants from a diploid genome. As a result, they provide valuable information about hereditary transmission, context of SV, regulation of gene expression and other features which are informative for understanding human genetics. Successful diplotyping with short read whole genome sequencing generally requires either a large population or parent-child trio samples. To overcome these limitations, we developed a targeted sequencing method for generating megabase (Mb)-scale haplotypes with short reads. One selects specific 0.1-0.2 Mb high molecular weight DNA targets with custom-designed Cas9-guide RNA complexes followed by sequencing with barcoded linked reads. To test this approach, we designed three assays, targeting the BRCA1 gene, the entire 4-Mb major histocompatibility complex locus and 18 well-characterized SVs, respectively. Using an integrated alignment- and assembly-based approach, we generated comprehensive variant diplotypes spanning the entirety of the targeted loci and characterized SVs with exact breakpoints. Our results were comparable in quality to long read sequencing.


Assuntos
Genoma Humano/genética , Variação Estrutural do Genoma/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento Completo do Genoma/métodos , Diploide , Regulação da Expressão Gênica/genética , Estudos de Associação Genética/métodos , Haplótipos/genética , Humanos , Análise de Sequência de DNA/métodos
14.
Front Microbiol ; 10: 826, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31068913

RESUMO

Colorectal cancer (CRC) is the third most common cancer worldwide. Its incidence is still increasing, and the mortality rate is high. New therapeutic and prognostic strategies are urgently needed. It became increasingly recognized that the gut microbiota composition differs significantly between healthy people and CRC patients. Thus, identifying the difference between gut microbiota of the healthy people and CRC patients is fundamental to understand these microbes' functional roles in the development of CRC. We studied the microbial community structure of a CRC metagenomic dataset of 156 patients and healthy controls, and analyzed the diversity, differentially abundant bacteria, and co-occurrence networks. We applied a modified zero-inflated lognormal (ZIL) model for estimating the relative abundance. We found that the abundance of genera: Anaerostipes, Bilophila, Catenibacterium, Coprococcus, Desulfovibrio, Flavonifractor, Porphyromonas, Pseudoflavonifractor, and Weissella was significantly different between the healthy and CRC groups. We also found that bacteria such as Streptococcus, Parvimonas, Collinsella, and Citrobacter were uniquely co-occurring within the CRC patients. In addition, we found that the microbial diversity of healthy controls is significantly higher than that of the CRC patients, which indicated a significant negative correlation between gut microbiota diversity and the stage of CRC. Collectively, our results strengthened the view that individual microbes as well as the overall structure of gut microbiota were co-evolving with CRC.

15.
BMC Genomics ; 20(Suppl 2): 185, 2019 Apr 04.
Artigo em Inglês | MEDLINE | ID: mdl-30967122

RESUMO

BACKGROUND: Discovering the key microbial species and environmental factors of microbial community and characterizing their relationships with other members are critical to ecosystem studies. The microbial co-occurrence patterns across a variety of environmental settings have been extensively characterized. However, previous studies were limited by their restriction toward pairwise relationships, while there was ample evidence of third-party mediated co-occurrence in microbial communities. METHODS: We implemented and applied the triplet-based liquid association analysis in combination with the local similarity analysis procedure to microbial ecology data. We developed an intuitive scheme to visualize those complex triplet associations along with pairwise correlations. Using a time series from the marine microbial ecosystem as example, we identified pairs of operational taxonomic units (OTUs) where the strength of their associations appeared to relate to the values of a third "mediator" variable. These "mediator" variables appear to modulate the associations between pairs of bacteria. RESULTS: Using this analysis, we were able to assess the OTUs' ability to regulate its functional partners in the community, typically not manifested in the pairwise correlation patterns. For example, we identified Flavobacteria as a multifaceted player in the marine microbial ecosystem, and its clades were involved in mediating other OTU pairs. By contrast, SAR11 clades were not active mediators of the community, despite being abundant and highly correlated with other OTUs. Our results suggested that Flavobacteria are more likely to respond to situations where particles and unusual sources of dissolved organic material are prevalent, such as after a plankton bloom. On the other hand, SAR11s are oligotrophic chemoheterotrophs with inflexible metabolisms, and their relationships with other organisms may be less governed by environmental or biological factors. CONCLUSIONS: By integrating liquid association with local similarity analysis to explore the mediated co-varying dynamics, we presented a novel perspective and a useful toolkit to analyze and interpret time series data from microbial community. Our augmented association network analysis is thus more representative of the true underlying dynamic structure of the microbial community. The analytic software in this study was implemented as new functionalities of the ELSA (Extended local similarity analysis) tool, which is available for free download ( http://bitbucket.org/charade/elsa ).


Assuntos
Algoritmos , Bactérias/classificação , Biologia Computacional/métodos , Metagenoma , Interações Microbianas , Microbiota , Software , Bactérias/genética , Biodiversidade
16.
Genes (Basel) ; 10(3)2019 03 14.
Artigo em Inglês | MEDLINE | ID: mdl-30875820

RESUMO

The increasing availability of large-scale time series data allows the inference of microbial community dynamics by association network analysis. However, correlation-based association network analyses are noninformative of causal, mediating and time-dependent relationships between microbial community functional factors. To address this insufficiency, we introduced the Granger causality model to the analysis of a recent marine microbial time series dataset. We systematically constructed a directed acyclic network, representing both internal and external causal relationships among the microbial and environmental factors. We further optimized the network by removing false causal associations using the conditional Granger causality. The final network was visualized as a Granger graph, which was analyzed to identify causal relationships driven by key functional operators in the environment, such as Gammaproteobacteria, which was Granger caused by total organic nitrogen and primary production (p < 0.05 and Q < 0.05).


Assuntos
Gammaproteobacteria/fisiologia , Interações Microbianas , Modelos Teóricos , Microbiota , Água do Mar/microbiologia
17.
Genes (Basel) ; 10(2)2019 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-30717284

RESUMO

The imbalance of human gut microbiota has been associated with colorectal cancer. In recent years, metagenomics research has provided a large amount of scientific data enabling us to study the dedicated roles of gut microbes in the onset and progression of cancer. We removed unrelated and redundant features during feature selection by mutual information. We then trained a random forest classifier on a large metagenomics dataset of colorectal cancer patients and healthy people assembled from published reports and extracted and analysed the information from the learned decision trees. We identified key microbial species associated with colorectal cancers. These microbes included Porphyromonasasaccharolytica, Peptostreptococcusstomatis, Fusobacterium,Parvimonas sp., Streptococcusvestibularis and Flavonifractorplautii. We obtained the optimal splitting abundance thresholds for these species to distinguish between healthy and colorectal cancer samples. This extracted consensus decision tree may be applied to the diagnosis of colorectal cancers.


Assuntos
Algoritmos , Neoplasias Colorretais/microbiologia , Microbioma Gastrointestinal , Metagenoma , Firmicutes/genética , Firmicutes/isolamento & purificação , Fusobacterium/genética , Fusobacterium/isolamento & purificação , Humanos , Porphyromonas/genética , Porphyromonas/isolamento & purificação , Análise de Sequência de DNA/métodos
18.
Genes (Basel) ; 9(6)2018 Jun 20.
Artigo em Inglês | MEDLINE | ID: mdl-29925824

RESUMO

With the rapid development of high-throughput sequencing technology, the analysis of metagenomic sequencing data and the accurate and efficient estimation of relative microbial abundance have become important ways to explore the microbial composition and function of microbes. In addition, the accuracy and efficiency of the relative microbial abundance estimation are closely related to the algorithm and the selection of the reference sequence for sequence alignment. We introduced the microbial core genome as the reference sequence for potential microbes in a metagenomic sample, and we constructed a finite mixture and latent Dirichlet models and used the Gibbs sampling algorithm to estimate the relative abundance of microorganisms. The simulation results showed that our approach can improve the efficiency while maintaining high accuracy and is more suitable for high-throughput metagenomic data. The new approach was implemented in our CoreProbe package which provides a pipeline for an accurate and efficient estimation of the relative abundance of microbes in a community. This tool is available free of charge from the CoreProbe's website: Access the Docker image with the following instruction: sudo docker pull panhongfei/coreprobe:1.0.

19.
Nucleic Acids Res ; 46(4): e19, 2018 02 28.
Artigo em Inglês | MEDLINE | ID: mdl-29186506

RESUMO

Large genomic rearrangements involve inversions, deletions and other structural changes that span Megabase segments of the human genome. This category of genetic aberration is the cause of many hereditary genetic disorders and contributes to pathogenesis of diseases like cancer. We developed a new algorithm called ZoomX for analysing barcode-linked sequence reads-these sequences can be traced to individual high molecular weight DNA molecules (>50 kb). To generate barcode linked sequence reads, we employ a library preparation technology (10X Genomics) that uses droplets to partition and barcode DNA molecules. Using linked read data from whole genome sequencing, we identify large genomic rearrangements, typically greater than 200kb, even when they are only present in low allelic fractions. Our algorithm uses a Poisson scan statistic to identify genomic rearrangement junctions, determine counts of junction-spanning molecules and calculate a Fisher's exact test for determining statistical significance for somatic aberrations. Utilizing a well-characterized human genome, we benchmarked this approach to accurately identify large rearrangement. Subsequently, we demonstrated that our algorithm identifies somatic rearrangements when present in lower allelic fractions as occurs in tumors. We characterized a set of complex cancer rearrangements with multiple classes of structural aberrations and with possible roles in oncogenesis.


Assuntos
Variação Estrutural do Genoma , Neoplasias/genética , Sequenciamento Completo do Genoma/métodos , Algoritmos , Aberrações Cromossômicas , Neoplasias Gastrointestinais/genética , Genoma Humano , Humanos
20.
Nucleic Acids Res ; 45(19): e162, 2017 Nov 02.
Artigo em Inglês | MEDLINE | ID: mdl-28977555

RESUMO

Genomic instability is a frequently occurring feature of cancer that involves large-scale structural alterations. These somatic changes in chromosome structure include duplication of entire chromosome arms and aneuploidy where chromosomes are duplicated beyond normal diploid content. However, the accurate determination of aneuploidy events in cancer genomes is a challenge. Recent advances in sequencing technology allow the characterization of haplotypes that extend megabases along the human genome using high molecular weight (HMW) DNA. For this study, we employed a library preparation method in which sequence reads have barcodes linked to single HMW DNA molecules. Barcode-linked reads are used to generate extended haplotypes on the order of megabases. We developed a method that leverages haplotypes to identify chromosomal segmental alterations in cancer and uses this information to join haplotypes together, thus extending the range of phased variants. With this approach, we identified mega-haplotypes that encompass entire chromosome arms. We characterized the chromosomal arm changes and aneuploidy events in a manner that offers similar information as a traditional karyotype but with the benefit of DNA sequence resolution. We applied this approach to characterize aneuploidy and chromosomal alterations from a series of primary colorectal cancers.


Assuntos
Aneuploidia , Haplótipos , Neoplasias/genética , Aberrações Cromossômicas , Neoplasias Colorretais/diagnóstico , Neoplasias Colorretais/genética , Análise Mutacional de DNA/métodos , Genoma Humano/genética , Instabilidade Genômica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Cariótipo , Cariotipagem/métodos , Neoplasias/diagnóstico , Reprodutibilidade dos Testes , Sensibilidade e Especificidade
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA